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Summary 


-The  inverse  Gaussian  or  the  first  passage  time  probability  distribution  for 
Brownian  motion  with  a  drift  is  particularly  important  for  modeling  and  inter¬ 
preting  observed  distributions  of  time  intervals  in  many  different  fields 
of  research.  In  this  paper  we  deal  with  the  problem  of  selecting  a  subset  of  k 
inverse  Gaussian  populations  which  includes  the  "best" population,  i.e.  the 
(unknown)  population  which  is  associated  with  the  largest  value  of  the  unknown 
means.  The  shape  parameters  of  the  inverse  Gaussian  distributions  are  assumed 
to  be  equal  for  all  the  k  populations.  When  the  common  shape  parameter  is 
known,  a  procedure  R^  is  defined  and  studied  which  selects  a  subset  which  is 
nonempty,  small  in  size  and  just  large  enough  to  guarantee  that  it  includes  the 
best  population  with  a  preassigned  probability  regardless  of  the  true  unknown 
values  of  the  means.  For  the  case  when  the  common  shape  parameter  is  unknown  a 
procedure  R^  is  proposed.  For  the  procedures  R^  and  R,),  we  obtain  exact  results 
for  k=<?  concerning  the  infimum  of  the  probability  of  a  correct  selection.  For 
k  >  3  a  lower  bound  on  the  probability  of  a  correct  selection  is  derived  for  each 
case.  Formulas  for  the  constants  d-j  and  d^  which  are  necessary  to  carry  out  the 
procedures  R1  and  R^,  respectively,  are  obtained.  An  upper  bound  on  the  expected 
number  of  populations  retained  in  the  selected  subset  is  given., — - 

If  the  best  population  is  defined  as  the  one  associated  with  the  largest  shape 
parameter,  it  is  shown  that  with  a  suitably  chosen  statistic,  this  problem  concides 
with  the  problem  of  selecting  a  subset  of  k  normal  populations  which  includes 
the  population  with  the  smallest  variance.  Similarly,  for  the  selection  of  a 
subset  containing  the  smallest  shape  parameter,  the  problem  reduces  to  selection 
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1 .  Introduction,  Basic  Concepts  and  Notation 

The  inverse  Gaussian  or  the  first  passage  time  probability  density 
function  (p.d.f.)  for  Brownian  motion  with  a  drift  is  particularly  important 
for  modeling  and  interpreting  observed  distributions  of  time  intervals  in 
many  different  fields  of  research.  For  example,  Hasofer  (1964)  considered  the 
inverse  Gaussian  model  for  the  emptiness  of  dam,  Marcus  (1976,  1976)  used  it 
in  communications  noise  and  highway  noise  models,  Banerjee  and  Bhattacharryya 
(1976)  applied  it  in  a  study  of  purchase  incidence  models,  Chhikara  and  Folks 
(1977)  studied  it  in  reliability  and  life  testing,  among  others.  Also  the 
statistician  often  finds  himself  dealing  with  data  of  considerable  skewness 
with  no  obvious  choice  of  distribution  suggested  by  physical  consideration. 

In  such  cases  the  choice  is  always  made  upon  the  basis  of  goodness-of-f it 
and  upon  the  ease  of  working  with  the  chosen  distribution.  Because  of  the  ease 
due  to  the  exact  sampling  distribution  theory  of  the  inverse  Gaussian  it  would 
appear  to  be  a  strong  candidate  in  such  cases  and  for  this  reason,  Chhikara  and 
Folks  (1977)  suggested  that  the  use  of  the  inverse  Gaussian  over  the  lognormal 
would  be  perferable. 

The  probability  distribution  of  the  first  passage  time  in  Brownian  motion 
with  a  drift  was  first  derived  by  Schrodinger  (1915).  Tweedie  (1956,  1957a, 
1957b)  studied  the  properties  of  it  and  proposed  the  name  inverse  Gaussian 
distribution  for  it.  This  distribution  is  also  known  as  Wald's  distribution 
(cf.  Wald  (1947)). 

*  This  research  was  supported  by  the  Office  of  Naval  Research  contract 
N0014-75-C-0455  at  Purdue  University. 


In  this  paper,  we  consider  the  problem  of  selecting  a  nonempty  (small)  sub¬ 
set  of  k  different  inverse  Gaussian  populations  which  contain  the  "best" 
i.e.  the  population  which  is  associated  with  the  largest  unknown  mean  or  the 
distribution  which  is  associated  with  the  largest  shape  parameter. 

The  inverse  Gaussian  distribution  has  two  parameters  with  p.d.f.  expressed, 
in  two  alternative  parametizations ,  as 


g(x,  \j,  <j  ,  a) 


exp 


o /2  n  x3  1 

=  0,  otherwise. 


( a  - vx ) ‘ 
_2,2  x 


x»viO )  &  s  0 


(1.1) 


and 


f (x ;  u,a)  =  ( — a  exp  |-  |,  x,m,a  >  0  (1.2) 

2ttxj  /  2u  x  1 

=  0,  otherwise. 

Expression  (1.1)  is  convenient  for  interpretation  in  terms  of  Brownian 

motion.  Suppose  W(x)  is  a  Brownian  motion  (Wiener  process,  see  Cox  and  Miller 

2 

(1965))  with  drift  v  and  variance  parameter  o  ,  i.e.  a  stochastic  process  with 
the  following  properties: 

(a)  W(0)  =  0  a.e.  and  W(x)  has  independent  increments; 

(b)  for  any  time  interval  (x-j  ,x^) ,  W(xg)-W(x^ )  is  normally  distributed 

2 

with  mean  v(x^-x^)  and  variance  o  (Xg-x^),  then  formula  (1.1)  gives  the  p.d.f.  of 
the  first  passage  time  X  of  W(x)  with  positive  drift  v  to  barrier  a  0  . 

Expression  (1.2)  is  useful  for  deriving  some  results  which  are  parallel  to 
that  of  the  usual  normal  distribution.  It  is  known  that  the  parameter  u  is  the 
mean  and  *  is  a  shape  parameter. 
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From  (1.1)  and  (1.2)  it  is  easy  to  see  that  the  relations  u  =  a/v  and 
2  2 

A  =  a  /0  hold.  Therefore,  comparing  k  inverse  Gaussian  means  p/s  is 
equivalent  in  some  sense  to  comparing  the  associated  drifts  of  Brownian  motion. 
Note  that  the  inverse  Gaussian  distribution  is  a  member  of  the  exponential  family. 
From  now  on  we  will  use  (1.2)  to  formulate  our  problem. 

For  a  random  variable  X  distributed  according  to  (1.2),  we  denote 
X  'v-  I(p,x).  For  this  distribution  a  is  a  shape  parameter,  the  mean  is  |j  and  the 
variance  is  u  /A.  If  X-|,...,X  is  a  random  sample  from  I(u.a),  Schrodinger 
(1915)  showed  that  the  maximum  likelihood  estimates  of  u  and  \  are  given  by 

n 

p  =  X  and  A  =  n/  J  ( 1 / X .  -  1/X), 
i  =  l 

where 


X  =  l  X./n. 
i=l 

n  5, 

Tweedie  (1957a)  proved  that  X  I(p,a),  A  ^(l/X.-1/X)  ■».  x„.i »  the  chi-square 

distribution  with  n-1  degrees  of  freedom  and  that  they  are  stochastically  inde¬ 
pendent.  The  statistics  X  and  rO/X  -  1/X)  jointly  are  sufficient  and  complete 
for  (u ,\) ,  and  X  is  a  complete  sufficient  statistic  for  v  if  a  is  known. 

Let  7t -j . nk  be  k  independent  inverse  Gaussian  populations  with  means 

u^,...,Mk  and  shape  parameters  \^,...,Ak,  respectively.  Let  upj 

be  the  ordered  p .  's.  It  is  assumed  that  there  is  no  prior  knowledge  of  the 
correct  pairing  of  the  ordered  and  the  unordered  u^'s.  Let  ,  j=l,...n-, 
i  =  l,...,k  be  independent  samples  for  n-| ,. . .  ,i'k ,  respectively,  and  let 


Xi 


ni 


=  l  X.  ./n. ,  i  =  l ,. . .  ,k  denote  the  sample  means. 
j=l  1J  1 


Let 


X(i)  and  n(i) 


denote 


the  sample  mean  and  sample  size  associated  with  the  unknown  population 
ti^.j  with  mean  i=l»...»k. 

Given  any  P*,  1/k  ■  P*  <  1 ,  our  goal  is  to  select  a  subset  of  these  k 
populations  such  that  the  subset  contains  the  best  population  with  probability 
at  least  P*,  no  matter  what  the  true  configuration  of  p^'s.  Selection  of  a  subset 
which  contains  the  best  population  is  called  a  correct  selection  and  is  denoted 
by  CS.  Therefore,  we  are  interested  in  defining  (and  studying)  a  selection 
procedure  R  such  that 

inf  P  (CS i R)  >  P*  (1.3) 

- 

where  is  is  the  set  of  all  k- tuples  (u-j  ». .  •  »u^) »  u  ^  >  0,  i=l > . . . ,k.  This  require¬ 
ment  will  be  referred  to  as  the  P*-condition.  In  Sections  2  and  3,  we  discuss 
the  cases  of  known  and  unknown  common  shape  parameter  x,  respectively.  For 
each  case,  a  conditional  selection  procedure  is  proposed  and  studied.  In 
Section  4,  the  problem  of  selecting  a  subset  which  contains  the  larges  shape 
parameter  is  considered.  It  is  snown  that  with  a  suitably  chosen  statistic  this 
problem  is  equivalent  to  the  problem  of  selecting  a  subset  of  k  normal  populations 
which  includes  the  population  with  the  smallest  variance.  In  other  words,  the 
problem  of  selecting  the  inverse  Gaussian  population  with  the  largest  (smallest) 
shape  parameter  reduces  to  the  problem  of  selecting  the  gamma  population  with 
tne  smallest  (largest)  scale  parameter. 

2 .  S  e 1 ec  tion  o  f  the  I n  v  er s  e  Gau  s  si  an  Popu lation  with  the  Largest  Mean  When 
>  .  =  a,  i =1 ,. . . ,k  is  Known 

2.1.  A  Conditional  Selection  Procedure  R] 

When  the  common  shape  parameter  is  known,  we  propose  the  following 
conditional  selection  procedure  R] : 


R-j :  Select  the  population  n .  if  and  only  if 

X.  ■>  max  X.  -  d. ( t) ,  given  T  =  7  X..  =  t  , 

1  l<j<k  J  1  i , j  1J 

where  t  >  0  and  d^(t)  is  the  smallest  positive  value  to  satisfy  the  P*-condition. 

It  is  known  that  for  two  independent  random  samples  X^,...,X^n,  from 
I(u^,a)  and  X2l"‘',X2n  from  *(^2’^’  t^ie  j°lnt  constitutes  a  three- 

parameter  exponential  family  and  may  be  written  in  the  form 
exp( ipt  +  eu  +  nv) , 

where 


ip  -  -  x(n-|U^  ^  ^^(n-j+ng), 

-2  -2 

o  -  -  A  ( u -j  —  ^2  )  ^  1  ^2/^ ( n  1  "^"02 )  ’ 

n  =  -  A/2, 


(2.1) 


and  t,  u,  v  denote  the  values  of  the  statistics 
nl  n2 

T  *  1  x!i  +  l  x!i  • 

i=l  11  j=l 

n . 

U  =  X,  -  X-,  where  X.  =  l  X.  /n . ,  i 
1  c  1  *=1  n  1 

nl  1  n2  1 

v  =  X  Vi  +  1  xi]  ■ 


and 


i  =  l 


j  =  l 


=  1,2, 


respectively. 

For  k  =  2,  the  following  theorem  gives  us  an  exact  result. 
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Theorem  2.1.  For  a  given  P*,  1/k  •  P*  •  1,  k  =  2,  let  d  1  ( t )  be  the  smallest  value 
such  that 

v,0  (h  -  \  -  d,(t)iT-o  ■  p* 

where  =  Ly  t  •  .=uk  >  0)  . 


Then,  inf  P  (CSjR)  =  P  „  (CS|R,)  =  P*  . 

M  U  t  iin  'I 

Jitii  —  ■  u 

Note  that  the  infimum  of  P(CSjR)  does  not  depend  on  the  common  value  of  u^Wn^. 


Proof:  Since  x  is  known,  the  joint  p.d.f.  of  X^,...,X^  and  X21”"’X2n 
belongs  to  a  two-parameter  exponential  family.  It  follows  from  an  argument  similar 
to  that  in  Lehmann  (1959,  p.  136)  that 


ph<CS'R)  ■  ’Vo  (*<1)  -  *<2)  -dl(t>lTn> 

-P6=0  <*(1)  -  *(2)  -  d,U)iT-t) 

*  • 

Hence  tnf  P  (CS;R.)  *  P  (CSIR.)  *  P*  . 

M  I  II  t  '-in  1 

Pt-  "  L) 


Lemma  2.1.  If  two  random  variables  Z  and  X  are  independent  of  another  random 
variable  Y,  and  if  the  just  p.d.f. ‘s  f'z  x  and  f^  x+y  exist,  then 

c*> 

fz  XfY(z.t)  =  J  fz  x(z  ,x)fy(t-x)dx  ;  (2.2) 

t 

=  /  f  (z,x)fY(t-x)dx,  (2.3) 

0  L ,A 

if  both  random  variables  X  and  Y  take  only  positive  values. 


Proof:  The  proof  is  straight  forward  and  hence  is  omitted. 

For  k  >  3,  based  on  the  Bonferroni  inequalities  and  Lemma  2.1,  we  drive 
a  lower  bound  on  the  probability  of  a  correct  selection  in  Theorem  2.2. 
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t,  let 


Theorem  2.2.  For  k  3,  and  given  P*,  1/k  •  P*  1  and  7  }’  X-. 

1  -P*  ( j )  1  ’  J 

P*  =  1  -  k_y  and  let  d^'(r)  be  the  smallest  value  such  that  for  any 

pfcU0  =  1m|u1  =  •  •  •  =  uk  =  v  01,  and  any  ij.j^i. 


p„(uij  <  c)  i  hj  =  r>  =  i 


(2.4) 


where 


U.,  =  X,M  -  X,-, 

U  0)  (J) 

TU  =  n(i)  \i)  +  "(j)  *(j)’  1  -  *  *  j  -  k' 


d-|(t)  =  max  {d\j  (r)  1 1 1  i  i  j  <  k,  0  <  r  y  t} 


(2.5) 


inf  P  (CS|R,)  >  P* 

V*  * 


Proof:  For  all  u£u 


P^CSI^) 

=  P  (L,,  >  max  Xm  -  d, ( t)  | T=t) 
if  ^  l<j<k-l  Uj  1 

=  1  -  P  <  max  Xm  -  d1(t)(T=t) 

H.  1  < j<k- 1  m  1 


>  2  -  k  +  J  P  (U-,  <  d,  (t)  |T=t) 
j=l  if  Jk  “  1 


(2.6) 


For  any  j,  lJ-k-1,  using  Lemma  2.1  we  have 
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p„  <ujk  ~  dl<t)iT  -  t) 
t 

=  /  P(Uik  £  (1 1  (t)  JT  =r)  f  (r).f  (t-r)dr/fT(t) 

0  Jk  1  Jk  1  j  k  T'Tjk  T 

£  /  P(U,k  £  d  P  ( r )  1 T - k  =  r)  f  ( r ) - f  (t-r)dr/f_(t) 
0  Jk  Jk  jk  Tjk  T_Tjk  T 

ipT  • 


Hence  inf  P  (CSjR,)  -  2-k  +  (k-1)  P*  =  P*  . 
.  .  v  I  -  i 


2.2.  Evaluation  of  Values  of  d^(t)  for  the  Procedure 


For  two  independent  random  samples  X-|p...,X.|  from  I  ( u  ^ ,  A )  and  X21,',,,X2n2 

from  I(u21a),  let  T  =  [  X, .  +  l  X?.,  then  it  follows  from  Tweedie  (1957a)  that 

i  j  3 

T  k-  I((n^+n2)u,  (n-j+n^)^  a)  if  npupu-  Chhikara  (  1975)  derived  the  conditional 
p.d.f.  g(ujt)  of  U  X,  -  X2,  given  T  =  t,  upp,  as 


g(u,t)  = 


"  n]n2(n1+n2)2  At3 

1/2 

2  2  i 

nln2^nl+n2^  >u 

_  u  )3(t-np)3  _ 

exp 

2t(t+n2u)  (t-np) 

(2.7) 


By  using  the  one  -  to-one  transformation 

(ryi^O-  (i^+n^u 
[t(t  +  n2u)(t-n1u)>- 


(2.8) 


b 


1 


it  can  be  shown  that  the  P*-percenti 1 e  point  i(P*)  i(P*  ,  n^ ,  n2>  t)  of  (J, 
given  T=t  i.e.,  the  solution  of  the  equation 

i  ( P* ) 

/  g(u | t ) du  =  P* 


is  given  by  the  following  equation 


n2'nl 


/2n,n;,X 

't(d0(t))  +  ny+riT,  exp(  t~~" 


where 


P*  , 


(2.9) 


dQ(t)  =  i(P*)(n]+n2)  [n^  x/ t(t+n2  i(P*))(t-n1  i(P*)>  , 


(2.10) 


and  <i>  is  the  cumulative  distribution  function  (c.d.f.)  of  a  standard  normal 
distribution. 

When  n]  =  n2  =  n>  the  equation  (2.8)  will  simplify  to  d^(t)  =  z(P*),  the 
P*-percentile  point  of  the  standard  normal  distribution.  Hence  we  have 


i(P*)= 


z3(P*)t3 _ 

4n  x+z3(P*)tn^  J 


(2.11) 


which  is  increasing  in  t,  if  n  is  fixed.  Note  that  i(p*)  -*■  0  as  n  •*  ■  if  t  =  0(n). 


Corollary  2.1.  For  k=2,  the  constant  d-j(t)  associated  with  the  procedure  is 
given  by 

d 1 ( t )  =  1(P*),  (2.12) 

where  i(P*)  is  given  by  (2.9)  or  (2.11). 


Corollary  2.2.  For  k  >_  3,  the  constant  d^(t)  associated  with  the  procedure 
is  given  by 

d^t)  =  max(  i  (Pt,  n^  ,  r)  |  1  i  f  j  •  k,  0  •  r  ■_  tl 

=  i(P|,  n,  n,  t),  if  =...=  n^  =  n.  (2.13) 
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2.3  An  Uoper  Bound  on  the  Expected  Subset  Size  and  Other  Properties  of  Procedure  R] 


for  any  given  values  of  k  and  P*.  the  size  of  the  selected  subset  S  by  using 


the  procedure  is  a  function  of  the  true  configuration  pT(i 


r  ■  ■ 


,)  and  it 


also  depends  on  n^,...,n^.  Note  t ha t  S  is  an  interger-valued  random  variable  which 
takes  values  1  to  k  inclusive.  Hence,  (in  alalogy  with  power  of  the  hypothesis 
testing  problem)  Ea  (SjR^)  can  be  looked  upon  as  a  measure  of  the  efficiency  of 
the  procedure  R^ .  We  now  discuss  how  to  evaluate  it.  We  consider  the  space  of 
all  si  ippage  configurations  of  the  type  j  =  . . .  =  p  j-  k_  ^  j=u  and  A  -  1;  and 

we  denote  this  space  by  i:(a).  We  also  assume  that  n^  =  n^ 

Then,  for  any  }1lu(a),  the  expected  size  of  the  selected  subset  is 

V 


:n.  =  n. 
k 


E^  (S|Rt)  =  P,  (X(k)  i  ^  max  ^  X(j )-6] (t) |T=t) 

>  max  X/- 


*  <k-»  p„  <*(i>  - 


-f,  (xo)  •  V> 


m 


2<_j<  k 

.  d1 (t) !T=t) 


d, (t) |T=t) 


+  (k'])  P,.(*(k)  •  *(l)  •  di<t)  tT=t) 


t/n  x+d,(t) 
1  +  (k-1)  /  /  1 


0 


0 


fX(k)|T/n^it/n)  fXn,|T/n(x't/n)d^dx 


(2. 14) 


0) 


where 


t/n 

fX(])|T/n(xit/n>  =  j  f-X(1),T/n-X(k)  i* t-r)dr/fT/„( t/n) , 
fX(k)|T/n(x|t/n)'fX(k)(x):T/n-X(k)/fT/n  (t/n)' 


and 


t/n 

fT/n(t/"W  fT/n-S,k)'"'X(k) 


( r ) f 5  (t/n-r)dr. 
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Note  that  fT/  ^  (•)  is  the  p.d.f.  of  I((k-l)n,  n(k-l)^x). 

T/n-X(k) 

Rema rk :  The  density  function  of  statistic  T  in  s ;(■'■)  or  in  any  other  non- 
homogeneous  space  is  difficult  to  evaluate  in  an  exact  form.  One  of  the 
reasons  for  such  difficulty  is  that  the  inverse  Gaussian  random  variables 
have  only  restrictive  additive  property  as  explained  below  (see  Chhikara 
and  Folks  (1975)).  We  know  that  if  Xj,X2,...,Xk  are  independent  inverse 

2 

Gaussian  variables  with  parameter  and  x ^ ,  then  xXi.'vI(iu^  ,F,(lu.j )  ) 

2 

if  and  only  if  x./u^  =5  for  all  i.  The  sufficient  part  was  shown  by 

Tweedie  and  the  necessary  part  was  given  by  Chhikara  (1972)  and  Shuster 
and  Miura  (1972) . 

Let  _X=  ( X]  i .  ,X-|n  ,X?1 .  ,X2n  . .  ,Xkl . .  ,Xkn  )  be  a  random  vector 

12  k 

then  X=x£IrN,  where  N=n-|  +  ...+nk.  A  selection  rule  can  be  denoted  by 

tf(x)  =  (cp1  (x) . cpk  (x)),  where  <p  (x]_:  1R  N->L0 , 1  ]  is  the  probability  that 

tt is  included  in  the  selected  subset  when  X=x  is  observed.  Similarly, 

T  t  T 

a  conditional  selection  rule  can  be  denoted  by  v”  (X)=(v^  (x),...,c,~  (x)), 
where  cf T  ( x )  is  the  conditional  probability  that  n-  is  included  in  the 
selected  subset,  given  T=;t,  when  X=x  is  observed.  It  is  easy  to  see  that 
T=T,  v  ■  (xj=0  or  1  and  (x)^0  when  rule  R-j  is  used. 


M 

Definition:  A  selection  rule  is  scale  invariant  if  for  every  xtF  ,  for 
every  real  number  c-0  and  for  every  i  =  l,...,k,  c,i  .  (cx)  =  g  .  (xj . 

We  may  define  scale  invariance  for  conditional  selection  procedures 
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1 


in  a  similar  way,  then  we  have  the  following  theorem: 

Theorem  2.3.  With  equal  sample  size,  the  procedure  with  constant  d-j(t) 
given  by  Theorem  2.1.  or  Theorem  2.2.  is  scale  invariant. 

Proof:  From  Corollary  2.1.  and  Corollary  2.2.  we  see  that  the  constants 
d^(t)  given  by  Theorem  2.1.  or  Theorem  2.2.  have  the  property 

d-|  (ct)=cd^{t)  for  all  c>0, 

so  we  have 

*J(C-)  (cx)  =  9.  T(x)  for  all  c>0,  and  i  =  l,...,k, 
hence  the  theorem  is  proved. 

2.4  Applications  to  a  Test  of  Homogeneity  for  .  .=u^ 

When  the  common  shape  parameter  X  is  known,  for  the  problem  of  the 
test  of  homogeneity  of  k  inverse  Gaussian  populations,  i.e.  the  test  of 
hypothesis : 

null  hypothesis  :';l=il2=‘ ‘  "=,Jk 

versus  are  not  all  equal, 

we  propose  the  following  conditional  procedure  (at  level  u)  y(T). 

The  procedure  y(T)  is: 

The  null  hypothesis  is  rejected  if  and  only  1f  *[k]'*[l]'dl(t)’ 

given  T~t,  where  d^(t)  is  given  by  (2.12)  or  (2.13)  with  P*=l-u/k. 

It  is  easy  to  see  that  the  procedure  v ( t )  has  the  probability  of 
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of  type-one  error  less  than  a,  since  under  null  hypothesis 


=  ^^[1  j<^[k]"^l  ^ ^  i  ”^=^) 

=  P(Xj<5?j-k-j-d^(t)  for  some  j  ! T=t ) 
k 

I  l  iP(Xj<X[k]-d1(t)|T=t) 

=  k- kP( 5?-j >5?|~k -j-d-j ( t )  |T=t) 

=  k-kP* 


=  a. 

For  k=2,  it  has  been  shown  that  the  procedure  y(t)  is  an  UMP  unbiased  test, 
see  Chhikara  (1975)  (also  Lehmann  (1959)). 

3.  Selection  of  the  Inverse  Gaussian  Population  with  the  Largest  Mean 
when  the  Common~SFape  Parameter  A  is  unknown"  I 

k  ni  ^ 

With  the  same  notations  as  before  and  let  V  =  V  J  X... 

i=l  j  =  l  1J 


3.1.  A  Conditional  Selection 

Rg.'  Select  the  population  if  and  only  if 

)(.>  max  X.-d„(t,v),  given  T=t,  V=v 
1_l<j<k  J  6 

where  t>0,  v>0;  d2(t,v)  is  the  smallest  positive  values  chosen  to  satisfy 


the  P*-condition. 

For  k=2,  we  have  the  following  theorem: 
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Theorem  3.1.  Given  l  <  P*  <  1,  k  =  2,  T  =  t  and  V  =  v,  let 

1 

^  L,1|n2(n1fn2-2)>  (n^n^Ju 

i[tv-(n1+n2)2](T+n2u)(T-n1u)}^ 

n]n2(n1+n2)2  u2  1 

[tv-(n1+n2)2](t+n2u)(t-n1u) 

where  , 

(n]n2)-(n]+n2)u 

_]  <  —  2  j- 

i[tv-(n1+n2)  ](t+n2u}(t-r.1u)]s 

and  let 

d2(t,v)  =  h_1 (c(t,v)) , 


(3-1) 


(3.2) 


where  the  constant  c  c(t,v)  is  determined  by 

?(n1+n?-3)/2 

n  _  n  r  1-,,  /„  ~  1  <- 


n?-n,  I"  tv-(n,-n?)t" 

Ht,n1+ri-2^c^  +  n7+rT"  ~  7  7~72 

12  12  Ltv-(n1+n2) 


(’-"t;n1+„3-2  «='))- 


where 


c'  =  {c2  +  4n1n2(n1+n2-2)/[tv-(n]-n2)2]r 


and  H|.n  +n  _2  denotes  the  c.d.f.  of  students's  t-distribution  with  n-j+n2-2 
degrees  of  freedom.  Then  inf  P(CS|R„)  =  inf  P(CS|R?)  =  P*. 

i?  11  ;0 

Proof:  For  fixed  t  and  v,  h(u)  is  a  monotone  nondecreasing  function  in  u,  hence 
h  '  exists  and 

h'](w)  =  {(n2-n1)  ty2  +  tyC^+n^  y2  +  4]*}/2( l+n^y2)  (3.4) 
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9  i  i  2  1 

where  y  =  w[tv-(n^+n^,)  ^'/(n-in-)-  (n^+n^)  [n^+n^-Z+w  ]• 

With  the  same  notations  as  that  in  Section  2,  it  follows  from  an  argument  similar 
to  that  as  in  Lehmann  (1959)  [see  P.136]  that 

inf  P  (CS|RJ 

i2  —  C 

=  inf  P(j  (X(1)-X(2)  •.  d2(t,v)|T=t,  V=v) 

=  Po=0  ^(1)^(2)  d2(t'v)  lT=t‘  V-v) 

P6=0  (h(U)  <  h(d2(t,v))|T=t,  V=v) 

Pe=0  (h(U)  <  c(t,v)jT=t,  V=v) 

=  P*, 

by  the  definition  of  c(t,v)  (see  Chhikara  (1975),  p.bl). 


Corollary  3.1.  In  Theorem  3.1,  if  we  have  a  common  sample  size,  say  n^n^n, 
then  the  constant  c  is  determined  by 


Ht;2n-2  ^c) 


P*  i.e.  c  -  H;!2n_2  (P*). 


(3.5) 


Thus  c  is  given  by  the  P*-percenti 1 e  of  a  t-distribution  with  2n-2  degrees  of 
freedom.  Consequently, 


d2(t,v)  = 


tC( tv-4n^)^ 
n[c^tv+4n^(2n-2)]1J 


(3.6) 


which  is  increasing  in  t  and  v,  if  n  is  fixed.  Mote  that  d?(t,v)  •  0  as  n  «■  ■ 
if  both  t  =  0(n)  and  v  =  0(n). 

Similar  to  Theorem  2.2.,  the  following  theorem  gives  a  lower  bound  on  the 
probability  of  a  correct  selection  in  case  of  k  -  3  where  the  common  shape 
parameter  x  is  unknown. 
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Theorem  3.2.  For  k  >  3,  given  P*,  1/k  <  P*  •  1,  T=t,  V=v,  suppose  the  common 

1-P*  (?) 

shape  parameter  A  is  unknown.  Let  P|  =  1  -  k_—  and  let  .  '  (t,v)  be  the 

smallest  value  such  that  for  any  =  iu|  up. ..  =uk  0} 


(Uj(  ■  dj21  (r,s)|Tf..r.  - 


■a  1J  1J 


ij 


P* 


where 


Uij  =  X0)  -  X(J)  * 


Tij  =  n(i)X(i)  +  n(j)X(j)  5 


and 


"(i) 


Vid  =  l 


n(j) 


p.=l 


C(u)  +  j,  X(JO 


,(2) 


1  <  i  t  j 


Let  d„(t,v)  =  max{d'.v  (r,s)|l  <_i  -f  j  <  k,  0  r  <_  t,  0  <  s  <  v}. 

I—  I  y) 

Then  inf  P(CS  [R2)  :  P*. 

u 

Proof:  Proof  is  almost  the  same  as  the  proof  of  Theorem  2.2.,  hence  it  is  omitted. 

Corol  lary  3. 2.  In  Theorem  3.2.,  if  =n^ . . .  =  nk=n  then  d2(t,v)  is  given  by  (3.7) 

- 1  * 

with  c  H,  .,( P  i ) ;  note  that  the  procedure  FL  is  scale  invariant, 
t  ;2n-2  I  2 


4.  Selection  From  Inverse  Gaussian  Populations  in  Terms  of  the  Shape  Parameters 

In  ranking  inverse  Gaussian  populations  in  terms  of  their  shape  parameters,  we 
defined  the  best  population  as  the  one  associated  with  With  the  same 

assumptions  as  given  in  Section  1,  for  all  i  -  l,...,k  let 

ni  (x..-,,.)2 

'  it  '  i  1 


s2  -  _L 
s-  l 


~  i  - — - - ,  if  is  known, 

1  a-  J=1  X4S  1 


(4.1) 


ij 


l  (xl  '  ) 

j  =  l  U  X.  ' 


,  if  it,  is  unknown. 


(4.2) 


1 6 


2  2 

then  A . S .  nas  a  chi-square  distribution  ,  with  v.  degrees  of  freedom  where 

11  V..  1 

v ^  =  n^  or  n.-l  depending  on  the  case  whether  n .  is  known  or  unknown.  Therefore, 

there  is  no  need  to  deal  with  the  cases  of  known  or  unknown  means  separately. 

2 

Using  statistics  S.  ,  i=l,...k,  the  problem  of  selecting  from  inverse 
Gaussian  populations  in  terms  of  shape  parameter  is  equivalent  to  the  problem 
of  selection  from  normal  populations  in  terms  of  variances  (see  Gupta  and 
Panchapakesan  (1979)). 

Foranequal  sample  size  case,  parallel  to  the  rule  of  Gupta  and  Sobel  (1962a), 
we  propose  a  rule  R^. 

R^:  Select  population  « .  if  and  only  if 

s2  r'1  s2 

si  '  c  b[i]  ’ 

where  0  <  C  _  C(v,k,P*)  ^  1  is  determined  so  that  the  P*-condition  is  satisfied. 
Here  =  {a_|a^  >  0).  It  is  easy  to  see  that  the  infimum  of  P ( CS  |  )  occurs  when 

X[l]  =  *[2]  =‘,-=  A[k]  and  1S  dependent  °f  the  common  value.  Thus  we  have 


inf  P  (CS|RJ  =  /  [l-x^(cx)]k_1  dxf(x) 

12  --  J  0  v 


(4.3) 


and  also  we  have  sup  E^( S | R^ )  =  kP*  • 

U 

The  c-values  can  be  found  in  Gupta  and  Sobel  (  1  962b)  for  k= 2 ( 1 )  11,  =2(2)  50 

and  P*=0.75,  0.9,  0.95  ana  0.99. 

For  an  unequal  sample  size  case,  some  results  are  available  in  Gupta  and 
Huang  (1976)  [see  also  Gupta  and  Panchapakesan  (1979)]. 


Remark  4. 1 .  Let  X . 


,[(j: 


be  the  harmonic  sample  mean  of 


tt and  let 


Si  =  *iH  +  “lh  ^  known , 


=  ,  if  i  .  is  unknown. 
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2  2 
then  using  the  statistic  is  equivalent  to  using  the  statistics  ,  i=l,...k, 

since  =  n^S^  for  all  i . 


Remark_4^2.  It  should  be  pointed  out  that  the  problem  of  selecting  the  inverse 


Gaussian  populations  in  terms  of  is  equivalent  to  the  problem  of  selecting 

from  gamma  populations  with  densities  — j--j 


-x/i) .  v-1 

e  1  ^ 


those  that  have  large  values  of  e. .  This  problem  has  been  solved  in  Gupta 
(1963),  where  appropriate  tables  are  also  provided. 
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and  studied  which  selects  a  subset  which  is  nonempty,  small  in  sice  and  just 
large  enough  to  guarantee  that  it  includes  the  best  population  with  a  preassigned 
probability  regardless  of  the  true  unknown  values  of  the  means,  for  the  case  when! 
the  common  shape  parameter  is  unknown  a  procedure  R0  is  proposed.  for  the  proce¬ 
dures  Rj  and  R.,,  we  obtain  exact  results  for  k  =  2  concerning  the  inti  mum  ol  the 

probability  of  a  correct  selection.  for  k  '  3  a  lower  bound  on  the  probability 
of  a  correct  selection  is  derived  for  each  case.  formulas  ten'  the  constants  d^ 

and  d^  which  arc  necessary  to  carry  out  the  procedures  Rj  and  R_,,  respectively, 

are  obtained.  An  upper  bound  on  the  expected  number  of  populations  retained  in 
the  selected  subset  is  given 

If  the  best  population  is  defined  as  the  one  associated  with  the  largest 
shape  parameter,  it  is  shown  that  with  a  suitably  chosen  statistic,  this  problem 
coincides  with  the  problem  of  selecting  a  subset  of  k  normal  populations  which 
includes  the  population  with  the  smallest  variance.  Similarly,  for  the  selec¬ 
tion  of  a  subset  containing  the  smallest  shape  parameter,  the  problem  reduces  to 
selection  in  terms  of  the  largest  scale  parameter  of  the  gamma  distributions. 
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